The chapter describes the functionalities and components, which are used to perform the categorisation and data mining of a content item.
content item categorisation
The item can be added in a different categor,y based on its content. This kind of categorisation is performed manually and is based on the decision of its editor. The plugins in the system, which build this component start with
Important services:
ICiCategorizationService – provides methods, which retrieve and save the categorisation of a content item. The service unites both the manual and the automatic content item categorisation services.
ICiCategorizationTreeService – provides categorisation trees from which the categories can be picked.
ICiConcreteCategorizationService – a parent service, which wraps both the automatic and manual categorizations.
keyword categorisation
This component is used for the manual categorisation of content items. The keywords are categorisation tags, which are assigned to a content item. The plugins in the system, which build this component start with
Important services:
CiKeywordCategorizationService – provides the manually assigned categorisation tags for the item.
CiKeywordCategorizationTreeService – provides the categorisation trees, from which the tags can be picked.
automatic categorisation
The component is used for the automatic categorisation of content items. This type of classification is performed based on the item's textual content. The plugins in the system, which build this component start with
Important services:
CiAutomaticCategorizationService – provides the automatic categorisation of the content item
categorisation history
Provides information about the changes of the classification of a given content item. The plugins in the system, which build this component start with
Important services:
CiCategorizationHistoryProvider – generates the categorisation history, based on a CiCategorizationHistory object.
CiCategorizationHistoryDetailsService – provides the categorisation history of the content item.
text mining
This component allows the extraction, processing and analysing of the text from the documents of a content item.
Important services:
ICiTermsConverterService - allows the convertion of important text mining excerpts to categorisation nodes, thus allowing the classification of the content item with its own important words and phrases
ATLAS (Applied Technology for Language-Aided CMS) is a project funded by the European Commission under the CIP ICT Policy Support Programme.